NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Privacy-Preserving Logging and Recovery for Growing Database

Fanelle, Maria; Bater, Johes (January 2025, North East Database Day)

Full Text Available
SPECIAL: SynoPsis AssistEd Secure Collaborative AnaLytics

https://doi.org/10.14778/3717755.3717764

Wang, Chenghong; Qiu, Lina; Bater, Johes; Luo, Yukui (December 2024, Proceedings of the VLDB Endowment)

Secure collaborative analytics (SCA) enables the processing of analytical SQL queries across data from multiple owners, even when direct data sharing is not possible. While traditional SCA provides strong privacy through data-oblivious methods, the significant overhead has limited its practical use. Recent SCA variants that allow controlled leakages under differential privacy (DP) strike balance between privacy and efficiency but still face challenges like unbounded privacy loss, costly execution plan, and lossy processing. To address these challenges, we introduce SPECIAL, the first SCA system that simultaneously ensures bounded privacy loss, advanced query planning, and lossless processing. SPECIAL employs a novelsynopsis-assisted secure processing model, where a one-time privacy cost is used to generate private synopses from owner data. These synopses enable SPECIAL to estimate compaction sizes for secure operations (e.g., filter, join) and index encrypted data without additional privacy loss. These estimates and indexes can be prepared before runtime, enabling efficient query planning and accurate cost estimations. By leveraging one-sided noise mechanisms and private upper bound techniques, SPECIAL guarantees lossless processing for complex queries (e.g., multi-join). Our comprehensive benchmarks demonstrate that SPECIAL outperforms state-of-the-art SCAs, with up to 80× faster query times, 900× smaller memory usage for complex queries, and up to 89× reduced privacy loss in continual processing.
more » « less
Full Text Available
Differentially Private Query Optimization

Alam, Sara; Wang, Chenghong; Bater, Johes (May 2024, North East Database Day)

Full Text Available
Longshot: Indexing Growing Databases Using MPC and Differential Privacy

https://doi.org/10.14778/3594512.3594529

Zhang, Yanping; Bater, Johes; Nayak, Kartik; Machanavajjhala, Ashwin (April 2023, Proceedings of the VLDB Endowment)

In this work, we propose Longshot, a novel design for secure outsourced database systems that supports ad-hoc queries through the use of secure multi-party computation and differential privacy. By combining these two techniques, we build and maintain data structures (i.e., synopses, indexes, and stores) that improve query execution efficiency while maintaining strong privacy and security guarantees. As new data records are uploaded by data owners, these data structures are continually updated by Longshot using novel algorithms that leverage bounded information leakage to minimize the use of expensive cryptographic protocols. Furthermore, Long-shot organizes the data structures as a hierarchical tree based on when the update occurred, allowing for update strategies that provide logarithmic error over time. Through this approach, Longshot introduces a tunable three-way trade-off between privacy, accuracy, and efficiency. Our experimental results confirm that our optimizations are not only asymptotic improvements but also observable in practice. In particular, we see a 5x efficiency improvement to update our data structures even when the number of updates is less than 200. Moreover, the data structures significantly improve query runtimes over time, about ~10³x faster compared to the baseline after 20 updates.
more » « less
Full Text Available
IncShrink: Architecting Efficient Outsourced Databases using Incremental MPC and Differential Privacy

https://doi.org/10.1145/3514221.3526151

Wang, Chenghong; Bater, Johes; Nayak, Kartik; Machanavajjhala, Ashwin (June 2022, SIGMOD '22: Proceedings of the 2022 International Conference on Management of Data)

In this paper, we consider secure outsourced growing databases (SOGDB) that support view-based query answering. These databases allow untrusted servers to privately maintain a materialized view. This allows servers to use only the materialized view for query processing instead of accessing the original data from which the view was derived. To tackle this, we devise a novel view-based SOGDB framework, Incshrink. The key features of this solution are: (i) Incshrink maintains the view using incremental MPC operators which eliminates the need for a trusted third party upfront, and (ii) to ensure high performance, Incshrink guarantees that the leakage satisfies DP in the presence of updates. To the best of our knowledge, there are no existing systems that have these properties. We demonstrate Incshrink's practical feasibility in terms of efficiency and accuracy with extensive experiments on real-world datasets and the TPC-ds benchmark. The evaluation results show that Incshrink provides a 3-way trade-off in terms of privacy, accuracy and efficiency, and offers at least a 7,800x performance advantage over standard SOGDB that do not support view-based query paradigm.
more » « less
Full Text Available
Visualizing Privacy-Utility Trade-Offs in Differentially Private Data Releases

https://doi.org/10.2478/popets-2022-0058

Nanayakkara, Priyanka; Bater, Johes; He, Xi; Hullman, Jessica; Rogers, Jennie (March 2022, Proceedings on Privacy Enhancing Technologies)

Abstract Organizations often collect private data and release aggregate statistics for the public’s benefit. If no steps toward preserving privacy are taken, adversaries may use released statistics to deduce unauthorized information about the individuals described in the private dataset. Differentially private algorithms address this challenge by slightly perturbing underlying statistics with noise, thereby mathematically limiting the amount of information that may be deduced from each data release. Properly calibrating these algorithms—and in turn the disclosure risk for people described in the dataset—requires a data curator to choose a value for a privacy budget parameter, ɛ . However, there is little formal guidance for choosing ɛ , a task that requires reasoning about the probabilistic privacy–utility tradeoff. Furthermore, choosing ɛ in the context of statistical inference requires reasoning about accuracy trade-offs in the presence of both measurement error and differential privacy (DP) noise. We present Vi sualizing P rivacy (ViP), an interactive interface that visualizes relationships between ɛ , accuracy, and disclosure risk to support setting and splitting ɛ among queries. As a user adjusts ɛ , ViP dynamically updates visualizations depicting expected accuracy and risk. ViP also has an inference setting, allowing a user to reason about the impact of DP noise on statistical inferences. Finally, we present results of a study where 16 research practitioners with little to no DP background completed a set of tasks related to setting ɛ using both ViP and a control. We find that ViP helps participants more correctly answer questions related to judging the probability of where a DP-noised release is likely to fall and comparing between DP-noised and non-private confidence intervals.
more » « less
Full Text Available
DP-Sync: Hiding Update Patterns in Secure Outsourced Databases with Differential Privacy

https://doi.org/10.1145/3448016.3457306

Wang, Chenghong; Bater, Johes; Nayak, Kartik; Machanavajjhala, Ashwin (June 2021, SIGMOD '21: Proceedings of the 2021 International Conference on Management of Data)

In this paper, we consider privacy-preserving update strategies for secure outsourced growing databases. Such databases allow appendonly data updates on the outsourced data structure while analysis is ongoing. Despite a plethora of solutions to securely outsource database computation, existing techniques do not consider the information that can be leaked via update patterns. To address this problem, we design a novel secure outsourced database framework for growing data, DP-Sync, which interoperate with a large class of existing encrypted databases and supports efficient updates while providing differentially-private guarantees for any single update. We demonstrate DP-Sync's practical feasibility in terms of performance and accuracy with extensive empirical evaluations on real world datasets.
more » « less
Full Text Available
Practical Security and Privacy for Database Systems

https://doi.org/https://doi.org/10.1145/3448016.3457544

He, Xi; Rogers, Jennie; Bater, Johes; Machanavajjhala, Ashwin; Wang, Chenghong; Wang, Xiao (July 2021, IGMOD '21: Proceedings of the 2021 International Conference on Management of Data)

Computing technology has enabled massive digital traces of our personal lives to be collected and stored. These datasets play an important role in numerous real-life applications and research analysis, such as contact tracing for COVID 19, but they contain sensitive information about individuals. When managing these datasets, privacy is usually addressed as an afterthought, engineered on top of a database system optimized for performance and usability. This has led to a plethora of unexpected privacy attacks in the news. Specialized privacy-preserving solutions usually require a group of privacy experts and they are not directly transferable to other domains. There is an urgent need for a generally trustworthy database system that offers end-to-end security and privacy guarantees. In this tutorial, we will first describe the security and privacy requirements for database systems in different settings and cover the state-of-the-art tools that achieve these requirements. We will also show challenges in integrating these techniques together and demonstrate the design principles and optimization opportunities for these security and privacy-aware database systems.
more » « less
Full Text Available
SAQE: practical privacy-preserving approximate query processing for data federations

https://doi.org/10.14778/3407790.3407854

Bater, Johes; Park, Yongjoo; He, Xi; Wang, Xiao; Rogers, Jennie (August 2020, Proceedings of the VLDB Endowment)
null (Ed.)
Full Text Available
Poirot: Private Contact Summary Aggregation

https://doi.org/10.1145/3384419.3430603

Zhang, Yanping; Wang, Chenghong; Pujol, David; Bater, Johes; Lentz, Matthew; Machanavajjhala, Ashwin; Nayak, Kartik; Vasudevan, Lavanya; Yang, Jun (November 2020, SenSys '20: Proceedings of the 18th Conference on Embedded Networked Sensor Systems)

Physical distancing between individuals is key to preventing the spread of a disease such as COVID-19. On the one hand, having access to information about physical interactions is critical for decision makers; on the other, this information is sensitive and can be used to track individuals. In this work, we design Poirot, a system to collect aggregate statistics about physical interactions in a privacy-preserving manner. We show a preliminary evaluation of our system that demonstrates the scalability of our approach even while maintaining strong privacy guarantees.
more » « less
Full Text Available

« Prev Next »

Search for: All records